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PROGNOSTIC CLASSIFICATION OF ENDOMETRIAL CANCER 

Related Applications 

This application claims the benefit under 35 U.S.C. § 1 19(e) of U.S. provisional 
application 60/221,735, filed July 31, 2000. 

Field of the Invention 

The invention relates to nucleic acid microarray markers for cancer, particularly for 
endometrial cancer. The invention also relates to methods for diagnosing cancer as well as 
optimizing cancer treatment strategies. 

Background of the Invention 

Endometrioid endometrial adenocarcinomas are the most common gynecologic 
malignancy, the risk of which is increased by an abnormal endocrine environment or 
premalignant lesions with loss of tumor suppressor function. The 6000 deaths yearly make 
uterine cancer the seventh leading cause of death from malignancy in females. It is primarily 
a disease of postmenopausal women, although 25 percent of cases occur in women below age 
50 and 5 percent below age 40 (Harrison's Principles of Internal Medicine 1998). 

Although much progress has been made toward understanding the biological basis of 
cancer and in its diagnosis and treatment, it is still one of the leading causes of death in the 
United States. Inherent difficulties in the diagnosis and treatment of cancer include among 
other things, the existence of many different subgroups of cancer and the concomitant 
variation in appropriate treatment strategies to maximize the likelihood of positive patient 
outcome. 

The prognosis of endometrial cancer depends upon stage, histologic grade, and extent 
of myometrial invasion. The staging of endometrial cancer requires surgery to establish the 
extent of disease and the depth of myometrial invasion. Peritoneal fluid should be sampled; 
the abdomen and pelvis explored; and pelvic and para-aortic lymphadenectomy performed 
depending upon the histology, grade, and depth of invasion in the uterine specimen on frozen 
section. 

Initial evaluation of patients suspected of endometrial cancer includes a history and 
physical and pelvic examination followed by an endometrial biopsy or a fractional dilation 
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and curettage. Outpatient procedures such as endometrial biopsy or aspiration curettage can 
be used but are definitive only when positive. Once a diagnosis is made, the options for 
treating endometrial cancer are assessed with respect to the needs of the patient. These 
options traditionally include surgical intervention, radiotherapy, chemotherapy, and adjuvant 
5 systemic therapies. Adjuvants may include but are not limited to chemotherapy, 
radiotherapy, and endocrine therapies with progestational agents such as 
hydroxyprogesterone, megastrol, and deoxyprogesterone, and the antiestrogen tamoxifen. 

It is difficult to predict from standard clinical and pathologic features the clinical 
course of endometrial cancer. However, it is very important in the treatment of endometrial 

10 cancer to select and implement an appropriate combination of therapeutic approaches. The 
available methods for designing strategies for treating endometrial cancer patients are 
complex and time consuming. The wide range of cancer subgroups and variations in disease 
progression limit the predictive ability of the healthcare professional. In addition, continuing 
development of novel treatment strategies and therapeutics will result in the addition of more 

15 variables to the already complex decision-making process involving matching the cancer 

patient with a treatment regimen that is appropriate and optimized for the cancer stage, extent 
of myometrial invasion, tumor growth rate, and other factors central to the individual 
patient's prognosis. Because of the critical importance of selecting appropriate treatment 
regimens for endometrial cancer patients, the development of guidelines for treatment 

20 selection is of key interest to those in the medical community and their patients. Thus, there 
presently is a need for objective, reproducible, and sensitive methods for predicting 
endometrial cancer patient outcome and selecting optimal treatment regimens. 

Summary of the Invention 

25 It now has been discovered that particular sets of genes are expressed differentially in 

normal and malignant endometrium. These sets of genes can be used to discriminate between 
normal and malignant endometrial tissues. Accordingly, diagnostic assays for classification 
of tumors, prediction of tumor outcome, selecting and monitoring treatment regimens, and 
monitoring tumor progression/regression can now be based on the expression of sets of 

30 genes. 

According to one aspect of the invention, methods for diagnosing endometrial cancer 
in a subject suspected of having endometrial cancer are provided. The methods include 
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obtaining from the subject an endometrial tissue sample and determining the expression of a 
set of nucleic acid molecules or expression products thereof in the endometrial tissue sample. 
The set of nucleic acid molecules includes at least two nucleic acid molecules selected from 
the group consisting of SEQ ID NOs: 1-50. In preferred embodiments, the endometrial tissue 
5 sample is suspected of being cancerous. 

In some embodiments the set of nucleic acid molecules includes more than 2, and up 
to all of the nucleic acid molecules set forth as SEQ ID NOs: 1-50, and any number of nucleic 
acid sequences between these two numbers. For example, in certain embodiments the set 
includes at least 3, 4, 5, 10, 15, 20, 30, 40 or more nucleic acid molecules of the nucleic acid 
10 molecules set forth as SEQ ID NOs: 1-50. 

In other embodiments, the method further includes determining the expression of the 
set of nucleic acid molecules or expression products thereof in a non-cancerous endometrial 
tissue sample, and comparing the expression of the set of nucleic acid molecules or 
expression products thereof in the endometrial tissue sample suspected of being cancerous 
15 and the non-cancerous endometrial tissue sample. 

The invention in another aspect provides solid-phase nucleic acid molecule arrays. 
The arrays have a cancer gene marker set that consists essentially of at least two and as many 
as all of the nucleic acid molecules set forth as SEQ ID NOs: 1-50 fixed to a solid substrate. 
The set of nucleic acid markers can include any number of nucleic acid sequences between 
20 these two numbers, selected from SEQ ID NOs: 1-50. For example, in certain embodiments 
the set includes at least 3, 4, 5, 10, 15, 20, 30, 40 or more nucleic acid molecules of the 
nucleic acid molecules set forth as SEQ ID NOs: 1-50. In some embodiments, the solid-phase 
nucleic acid molecule array also includes at least one control nucleic acid molecule. 

In certain embodiments, the solid substrate includes a material selected from the 
25 group consisting of glass, silica, aluminosilicates, borosilicates, metal oxides such as alumina 
and nickel oxide, various clays, nitrocellulose, or nylon. Preferably the substrate is glass. 

In other embodiments, the nucleic acid molecules are fixed to the solid substrate by 
covalent bonding. 

According to yet another aspect of the invention, protein microarrays are provided. 
30 The protein microarrays include antibodies or antigen-binding fragments thereof, that 

specifically bind at least two different polypeptides selected from the group consisting of 
SEQ ID NOs:5 1-100, fixed to a solid substrate. In some embodiments, the microarray 



comprises antibodies or antigen -binding fragments thereof, that bind specifically to least 3, 4, 
5, 6, 7, 8, 9, 10, 1 1, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49 or 50 different 
polypeptides selected from the group consisting of SEQ ID NOs:5 1-100. In certain 
5 embodiments, the microarray also includes an antibody or antigen-binding fragment thereof, 
that binds specifically to a cancer-associated polypeptide other than those selected from the 
group consisting of SEQ ID NOs:51-100, preferably An endometrial cancer associated 
polypeptide. In some embodiments, the protein microarray also includes at least one control 
polypeptide molecule. In further embodiments, the antibodies are monoclonal or polyclonal 

10 antibodies. In other embodiments, the antibodies are chimeric, human, or humanized 

antibodies. In some embodiments, the antibodies are single chain antibodies. In still other 
embodiments, the antigen-binding fragments are F(ab')2, Fab, Fd, or Fv fragments. 

According to yet another aspect of the invention, protein microarrays are provided. 
The protein microarrays include antibodies or antigen-binding fragments thereof, that 

15 specifically bind at least two different polypeptides selected from the group consisting of 
SEQ ID NOs:5 1-100, fixed to a solid substrate. In some embodiments, the microarray 
comprises antibodies or antigen -binding fragments thereof, that bind specifically to least 3, 4, 
5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 
31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49 or 50 different 

20 polypeptides selected from the group consisting of SEQ ID NOs:5 1-1 00. In certain 

embodiments, the microarray also includes an antibody or antigen-binding fragment thereof, 
that binds specifically to a cancer-associated polypeptide other than those selected from the 
group consisting of SEQ ID NOs:51-100, preferably an endometrial cancer associated 
polypeptide. In some embodiments, the protein microarray also includes at least one control 

25 polypeptide molecule. In further embodiments, the antibodies are monoclonal or polyclonal 
antibodies. In other embodiments, the antibodies are chimeric, human, or humanized 
antibodies. In some embodiments, the antibodies are single chain antibodies. In still other 
embodiments, the antigen-binding fragments are F(ab') 2 , Fab, Fd, or Fv fragments. 

In a further aspect of the invention, methods for identifying lead compounds for a 

30 pharmacological agent useful in the treatment of endometrial cancer are provided. The 
methods include contacting an endometrial cancer cell or tissue with a candidate 
pharmacological agent, and determining the expression of a set of nucleic acid molecules in 



the endometrial cancer cell or tissue sample under conditions which, in the absence of the 
candidate pharmacological agent, permit a first amount of expression of the set of nucleic 
acid molecules. The set of nucleic acid molecules includes at least two and as many as all of 
the nucleic acid molecules set forth as SEQ ID NOs:l-50. The methods also include 
5 detecting a test amount of the expression of the set of nucleic acid molecules, wherein a 
decrease in the test amount of expression in the presence of the candidate pharmacological 
agent relative to the first amount of expression indicates that the candidate pharmacological 
agent is a lead compound for a pharmacological agent that is useful in the treatment of 
endometrial cancer. 

10 In some embodiments of any of the foregoing methods and products, the differences 

in the expression of the nucleic acid molecules are determined by nucleic acid hybridization 
or nucleic acid amplification methods. Preferably the nucleic acid hybridization is performed 
using a solid-phase nucleic acid molecule array. In other embodiments, the differences in the 
expression of the nucleic acid molecules are determined by protein expression analysis, 

15 preferably SELDI mass spectroscopy. 

These and other aspects of the invention will be described in greater detail below. 



Detailed Description of the Invention 

The invention described herein relates to the identification of a set of genes expressed 
20 in endometrial cancer tissue that are predictive of the clinical outcome of the cancer. 
Changes in cell phenotype in cancer are often the result of one or more changes in the 
genome expression of the cell. Some genes are expressed in tumor cells, and not in normal 
cells. In addition, different genes are expressed in different subgroups of endometrial 
cancers, which have different prognoses and require different treatment regimens to optimize 
25 patient outcome. The differential expression of endometrial cancer genes can be examined 
by the assessment of nucleic acid or protein expression in the endometrial cancer tissue. 

The genes identified permit, inter alia, rapid screening of cancer samples by nucleic 
acid microarray hybridization or protein expression technology to determine the expression 
of the specific genes and thereby to predict the outcome of the cancer. Such screening is 
30 beneficial, for example, in selecting the course of treatment to provide to the cancer patient, 
and to monitor the efficacy of a treatment. 
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The invention differs from traditional endometrial cancer diagnostic and classification 
techniques with respect to the speed, simplicity, and reproducibility of the cancer diagnostic 
assay. The invention also presents targets for drug development because it identifies genes 
that are differentially expressed in outcome endometrial tumors, which can be utilized in the 
5 development of drugs to treat such tumors, e.g., by reducing expression of the genes or 
reducing activity of proteins encoded by the genes. 

The invention simplifies prognosis determination by providing an identified set of 
genes whose expression in endometrial cancers predicts clinical outcome as defined by tumor 
metastasis, recurrence, or death. In the invention RNA expression phenotyping was 

10 performed using high density microarrays generated from quantitative expression data on 
over 5000 (estimated 5800) genes, which have been analyzed to identify 50 specific probe 
sets (genes). The expression gene set has multifold uses including, but not limited to, the 
following examples. The expression gene set may be used as a prognostic tool for 
endometrial cancer patients, to make possible more finely tuned diagnosis of endometrial 

15 cancer and allow healthcare professionals to tailor treatment to individual patients' needs. 
The invention can also assess the efficacy of endometrial cancer treatment by determining 
progression or regression of endometrial cancer in patients before, during, and after 
endometrial cancer treatment. Another utility of the expression gene set is in the 
biotechnology and pharmaceutical industries' research on disease pathway discovery for 

20 therapeutic targeting. The invention can identify alterations in gene expression in 

endometrial cancer and can also be used to uncover and test candidate pharmaceutical agents 
to treat endometrial cancer. 

As used herein, a subject is a human, non-human primate, cow, horse, pig, sheep, 
goat, dog, cat, or rodent. In all embodiments human subjects are preferred. Preferably the 

25 subject is a human either suspected of having endometrial cancer, or having been diagnosed 
with endometrial cancer. In a preferred embodiment of the invention the cancer is 
endometroid endometrial adenocarcinoma. Methods for identifying subjects suspected of 
having endometrial cancer may include physical examination, subject's family medical 
history, subject's medical history, endometrial biopsy, or a number of imaging technologies 

30 such as ultrasonography, computed tomography, magnetic resonance imaging, magnetic 
resonance spectroscopy, or positron emission tomography. Diagnostic methods for 



endometrial cancer and the clinical delineation of endometrial cancer diagnoses are well 
known to those of skill in the medical arts. 

As used herein, endometrial tissue sample is tissue obtained from an endometrial 
tissue biopsy using methods well known to those of ordinary skill in the related medical arts. 

5 The phrase "suspected of being cancerous" as used herein means an endometrial cancer tissue 
sample believed by one of ordinary skill in the medical arts to contain cancerous cells. 
Methods for obtaining the sample from the biopsy include gross apportioning of a mass, 
microdissection, laser-based microdissection, cytologic sampling of the endometrium using a 
brush, aspiration curettage, fractional dilation and curettage, or other art-known cell- 

10 separation methods. 

Because of the variability of the cell types in diseased-tissue biopsy material, and the 
variability in sensitivity of the diagnostic methods used, the sample size required for analysis 
may range from 1, 10, 50, 100, 200, 300, 500, 1000, 5000, 10,000, to 50,000 or more cells. 
The appropriate sample size may be determined based on the cellular composition and 

15 condition of the biopsy and the standard preparative steps for this determination and 

subsequent isolation of the nucleic acid for use in the invention are well known to one of 
ordinary skill in the art. An example of this, although not intended to be limiting, is that in 
some instances a sample from the biopsy may be sufficient for assessment of RNA 
expression without amplification, but in other instances the lack of suitable cells in a small 

20 biopsy region may require use of RNA conversion and/or amplification methods or other 

methods to enhance resolution of the nucleic acid molecules. Such methods, which allow use 
of limited biopsy materials, are well known to those of ordinary skill in the art and include, 
but are not limited to: direct RNA amplification, reverse transcription of RNA to cDNA, 
amplification of cDNA, or the generation of radio-labeled nucleic acids. 

25 As used herein, the phrase "determining the expression of a set of nucleic acid 

molecules in the endometrial tissue" means identifying RNA transcripts in the tissue sample 
by analysis of nucleic acid or protein expression in the tissue sample. As used herein, "set" 
refers to a group of nucleic acid molecules that include 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 
15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 

30 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 different nucleic acid sequences from the group of 
nucleic acid sequences numbered 1 through 50 in Table 1 (SEQ ID NOs: 1-50). 
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The expression of the set of nucleic acid molecules in the sample from the 
endometrial cancer patient can be compared to the expression of the set of nucleic acid 
molecules in a sample of endometrial tissue that is non-cancerous. As used herein, non- 
cancerous endometrial tissue means tissue determined by one of ordinary skill in the medical 

5 art to have no evidence of endometrial cancer based on standard diagnostic methods 
including, but not limited to, histologic staining and microscopic analysis. 

Nucleic acid markers for cancer are nucleic acid molecules that by their presence or 
absence indicate the presence of absence of endometrial cancer. In tissue, certain nucleic 
acid molecules are expressed at different levels depending on whether tissue is non-cancerous 

10 or cancerous. 

Hybridization methods for nucleic acids are well known to those of ordinary skill in 
the art (see, e.g. Molecular Cloning: A Laboratory Manual, J. Sambrook, et al., eds., Second 
Edition, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York, 1989, or 
Current Protocols in Molecular Biology, F.M. Ausubel, et al., eds., John Wiley & Sons, 

15 Inc., New York). The nucleic acid molecules from an endometrial cancer tissue sample 

hybridize under stringent conditions to nucleic acid markers expressed in endometrial cancer. 
In one embodiment the markers are sets of two or more of the nucleic acid molecules as set 
forth in SEQ ID NOs: 1 through 50. 

The endometrial cancer nucleic acid markers disclosed herein are known genes and 

20 fragments thereof. It may be desirable to identify variants of those genes, such as allelic 
variants or single nucleotide polymorphisms (SNPs) in tissues. Accordingly, methods for 
identifying endometrial cancer nucleic acid markers, including variants of the disclosed full- 
length cDNAs, genomic DNAs, and SNPs are also included in the invention. The methods 
include contacting a nucleic acid sample (such as a cDNA library, genomic library, genomic 

25 DNA isolate, etc.) with a nucleic acid probe or primer derived from one of SEQ ID NOs: 1- 
50. The nucleic acid sample and the probe or primer hybridize to complementary nucleotide 
sequences of nucleic acids in the sample, if any are present, allowing detection of nucleic 
acids related to SEQ ID NOs: 1-50. Preferably the probe or primer is detectably labeled. The 
specific conditions, reagents, and the like can be selected by one of ordinary skill in the art to 

30 selectively identify nucleic acids related to sets of two or more of SEQ ID NOs: 1 through 50. 
The isolated nucleic acid molecule can be sequenced according to standard procedures. 
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In addition to native nucleic acid markers (SEQ ID NOs:l-50), the invention also 
includes degenerate nucleic acids that include alternative codons to those present in the native 
materials. For example, serine residues are encoded by the codons TCA, AGT, TCC, TCG, 
TCT, and AGC. Each of the six codons is equivalent for the purposes of encoding a serine 
5 residue. Similarly, nucleotide sequence triplets that encode other amino acid residues 
include, but are not limited to: CCA, CCC, CCG, and CCT (proline codons); CGA, CGC, 
CGG, CGT, AGA, and AGG (arginine codons); ACA, ACC, ACG, and ACT (threonine 
codons); AAC and AAT (asparagine codons); and ATA, ATC, and ATT (isoleucine codons). 
Other amino acid residues may be encoded similarly by multiple nucleotide sequences. Thus, 

10 the invention embraces degenerate nucleic acids that differ from the biologically isolated 
nucleic acids in codon sequence due to the degeneracy of the genetic code. 

The invention also provides modified nucleic acid molecules, which include 
additions, substitutions, and deletions of one or more nucleotides such as the allelic variants 
and SNPs described above. In preferred embodiments, these modified nucleic acid molecules 

15 and/or the polypeptides they encode retain at least one activity or function of the unmodified 
nucleic acid molecule and/or the polypeptides, such as hybridization, antibody binding, etc. 
In certain embodiments, the modified nucleic acid molecules encode modified polypeptides, 
preferably polypeptides having conservative amino acid substitutions. As used herein, a 
"conservative amino acid substitution" refers to an amino acid substitution which does not 

20 alter the relative charge or size characteristics of the protein in which the amino acid 

substitution is made. Conservative substitutions of amino acids include substitutions made 
amongst amino acids within the following groups: (a) M, I, L, V; (b) F, Y, W; (c) K, R, H; 
(d) A, G; (e) S, T; (f) Q, N; and (g) E, D. The modified nucleic acid molecules are 
structurally related to the unmodified nucleic acid molecules and in preferred embodiments 

25 are sufficiently structurally related to the unmodified nucleic acid molecules so that the 

modified and unmodified nucleic acid molecules hybridize under stringent conditions known 
to one of skill in the art. 

For example, modified nucleic acid molecules that encode polypeptides having single 
amino acid changes can be prepared for use in the methods and products disclosed herein. 

30 Each of these nucleic acid molecules can have one, two, or three nucleotide substitutions 
exclusive of nucleotide changes corresponding to the degeneracy of the genetic code as 
described herein. Likewise, modified nucleic acid molecules that encode polypeptides 
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having two amino acid changes can be prepared, which have, e.g., 2-6 nucleotide changes. 
Numerous modified nucleic acid molecules like these will be readily envisioned by one of 
skill in the art, including for example, substitutions of nucleotides in codons encoding amino 
acids 2 and 3, 2 and 4, 2 and 5, 2 and 6, and so on. In the foregoing example, each 
5 combination of two amino acids is included in the set of modified nucleic acid molecules, as 
well as all nucleotide substitutions that code for the amino acid substitutions. Additional 
nucleic acid molecules that encode polypeptides having additional substitutions (i.e., 3 or 
more), additions or deletions [e.g., by introduction of a stop codon or a splice site(s)] also can 
be prepared and are embraced by the invention as readily envisioned by one of ordinary skill 
10 in the art. Any of the foregoing nucleic acids can be tested by routine experimentation for 
retention of structural relation to or activity similar to the nucleic acids disclosed herein. 

In the invention, standard hybridization techniques of microarray technology are 
utilized to assess patterns of nucleic acid expression and identify nucleic acid marker 
expression. Microarray technology, which is also known by other names including: DNA 
15 chip technology, gene chip technology, and solid-phase nucleic acid array technology, is well 
known to those of ordinary skill in the art and is based on, but not limited to, obtaining an 
array of identified nucleic acid probes on a fixed substrate, labeling target molecules with 
reporter molecules (e.g., radioactive, chemiluminescent, or fluorescent tags such as 
fluorescein, Cye3-dUTP, or Cye5-dUTP), hybridizing target nucleic acids to the probes, and 
20 evaluating target-probe hybridization. A probe with a nucleic acid sequence that perfectly 
matches the target sequence will, in general, result in detection of a stronger reporter- 
molecule signal than will probes with less perfect matches. Many components and 
techniques utilized in nucleic acid microarray technology are presented in The Chipping 
Forecast, Nature Genetics, Vol.21, Jan 1999, the entire contents of which is incorporated by 
25 reference herein. 

According to the present invention, microarray substrates may include but are not 
limited to glass, silica, alumino silicates, borosilicates, metal oxides such as alumina and 
nickel oxide, various clays, nitrocellulose, or nylon. In all embodiments a glass substrate is 
preferred. According to the invention, probes are selected from the group of nucleic acids 
30 including, but not limited to: DNA, genomic DNA, cDNA, and oligonucleotides; and may be 
natural or synthetic. Oligonucleotide probes preferably are 20 to 25-mer oligonucleotides 
and DNA/cDNA probes preferably are 500 to 5000 bases in length, although other lengths 
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may be used. Appropriate probe length may be determined by one of ordinary skill in the art 
by following art-known procedures. In one embodiment, preferred probes are sets of two or 
more of the nucleic acid molecules set forth as SEQ ID NO: 1 through 50 (see also Table 1). 
Probes may be purified to remove contaminants using standard methods known to those of 
5 ordinary skill in the art such as gel filtration or precipitation. 

In one embodiment, the microarray substrate may be coated with a compound to 
enhance synthesis of the probe on the substrate. Such compounds include, but are not limited 
to, oligoethylene glycols. In another embodiment, coupling agents or groups on the substrate 
can be used to covalently link the first nucleotide or olignucleotide to the substrate. These 

10 agents or groups may include, but are not limited to: amino, hydroxy, bromo, and carboxy 
groups. These reactive groups are preferably attached to the substrate through a hydrocarbyl 
radical such as an alkylene or phenylene divalent radical, one valence position occupied by 
the chain bonding and the remaining attached to the reactive groups. These hydrocarbyl 
groups may contain up to about ten carbon atoms, preferably up to about six carbon atoms. 

15 Alkylene radicals are usually preferred containing two to four carbon atoms in the principal 
chain. These and additional details of the process are disclosed, for example, in U.S. Patent 
4,458,066, which is incorporated by reference in its entirety. 

In one embodiment, probes are synthesized directly on the substrate in a 
predetermined grid pattern using methods such as light-directed chemical synthesis, 

20 photochemical deprotection, or delivery of nucleotide precursors to the substrate and 
subsequent probe production. 

In another embodiment, the substrate may be coated with a compound to enhance 
binding of the probe to the substrate. Such compounds include, but are not limited to: 
polylysine, amino silanes, amino-reactive silanes (Chipping Forecast, 1999) or chromium 

25 (Gwynne and Page, 2000). In this embodiment, presynthesized probes are applied to the 

substrate in a precise, predetermined volume and grid pattern, utilizing a computer-controlled 
robot to apply probe to the substrate in a contact-printing manner or in a non-contact manner 
such as ink jet or piezo-electric delivery. Probes may be covalently linked to the substrate 
with methods that include, but are not limited to, UV-irradiation. In another embodiment 

30 probes are linked to the substrate with heat. 

Targets are nucleic acids selected from the group, including but not limited to: DNA, 
genomic DNA, cDNA, RNA, mRNA and may be natural or synthetic. In all embodiments, 
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nucleic acid molecules from human endometrial tissue are preferred. The tissue may be 
obtained from a subject or may be grown in culture (e.g. from a endometrial cancer cell line). 

In embodiments of the invention one or more control nucleic acid molecules are 
attached to the substrate. Preferably, control nucleic acid molecules allow determination of 
5 factors including but not limited to: nucleic acid quality and binding characteristics; reagent 
quality and effectiveness; hybridization success; and analysis thresholds and success. Control 
nucleic acids may include but are not limited to expression products of genes such as 
housekeeping genes or fragments thereof. 

To select a set of tumor markers, the expression data generated by, for example, 

10 microarray analysis of gene expression, preferably is analyzed to determine which genes in 
different groups of cancer tissues are significantly differentially expressed. In the methods 
disclosed herein, the significance of gene expression was determined using Permax computer 
software, although any standard statistical package that can discriminate significant 
differences in expression may be used. Permax performs permutation 2-sample t-tests on 

15 large arrays of data. For high dimensional vectors of observations, the Permax software 
computes t-statistics for each attribute, and assesses significance using the permutation 
distribution of the maximum and minimum overall attributes. 

In one embodiment of the invention, expression of nucleic acid markers is used to 
select clinical treatment paradigms for endometrial cancer. Treatment options, as described 

20 herein, may include but are not limited to: radiotherapy, chemotherapy, adjuvant therapy, or 
any combination of the aforementioned methods. Aspects of treatment that may vary 
include, but are not limited to: dosages, timing of administration, or duration or therapy; and 
may or may not be combined with other treatments, which may also vary in dosage, timing, 
or duration. Another treatment for endometrial cancer is surgery, which can be utilized either 

25 alone or in combination with any of the aforementioned treatment methods. One of ordinary 
skill in the medical arts may determine an appropriate treatment paradigm based on 
evaluation of differential expression of sets of two or more of the nucleic acid targets ste forth 
as SEQ ID NOs:l-50. Cancers that express markers that are indicative of a more aggressive 
cancer or poor prognosis may be treated with more aggressive therapies. 

30 Progression or regression of endometrial cancer is determined by comparison of two 

or more different endometrial cancer tissue samples taken at two or more different times from 
a subject. For example, progression or regression may be evaluated by assessments of 
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expression of sets of two or more of the nucleic acid targets, including but not limited to SEQ 
ID NOs:l-50, in an endometrial cancer tissue sample from a subject before, during, and 
following treatment for endometrial cancer. 

In another embodiment, novel pharmacological agents useful in the treatment of 
5 endometrial cancer can be identified by assessing variations in the expression of sets of two 
or more endometrial cancer nucleic acid markers, from among SEQ ID NOs:l-50, prior to 
and after contacting endometrial cancer cells or tissues with candidate pharmacological 
agents for the treatment of endometrial cancer. The cells may be grown in culture (e.g. from 
an endometrial cancer cell line), or may be obtained from a subject, (e.g. in a clinical trial of 

10 candidate pharmaceutical agents to treat endometrial cancer). Alterations in expression of 
two or more sets of endometrial cancer nucleic acid markers, from among SEQ ID NOs:l-50, 
in endometrial cancer cells or tissues tested before and after contact with a candidate 
pharmacological agent to treat endometrial cancer, indicate progression, regression, or stasis 
of the endometrial cancer thereby indicating efficacy of candidate agents and concomitant 
" 15 identification of lead compounds for therapeutic use in endometrial cancer. 

The invention further provides efficient methods of identifying pharmacological 
agents or lead compounds for agents active at the level of endometrial cancer cellular 
function. Generally, the screening methods involve assaying for compounds that beneficially 
alter endometrial cancer nucleic acid molecule expression. Such methods are adaptable to 

20 automated, high-throughput screening of compounds. 

The assay mixture comprises a candidate pharmacological agent. Typically, a 
plurality of assay mixtures are run in parallel with different agent concentrations to obtain a 
different response to the various concentrations. Typically, one of these concentrations 
serves as a negative control, i.e., at zero concentration of agent or at a concentration of agent 

25 below the limits of assay detection. Candidate agents encompass numerous chemical classes, 
although typically they are organic compounds. Preferably, the candidate pharmacological 
agents are small organic compounds, i.e., those having a molecular weight of more than 50 
yet less than about 2500, preferably less than about 1000 and, more preferably, less than 
about 500. Candidate agents comprise functional chemical groups necessary for structural 

30 interactions with polypeptides and/or nucleic acids, and typically include at least an amine, 
carbonyl, hydroxyl, or carboxyl group, preferably at least two of the functional chemical 
groups and more preferably at least three of the functional chemical groups. The candidate 
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agents can comprise cyclic carbon or heterocyclic structure and/or aromatic or polyaromatic 
structures substituted with one or more of the above-identified functional groups. Candidate 
agents also can be biomolecules such as peptides, saccharides, fatty acids, sterols, 
isoprenoids, purines, pyrimidines, derivatives or structural analogs of the above, or 
5 combinations thereof and the like. Where the agent is a nucleic acid, the agent typically is a 
DNA or RNA molecule, although modified nucleic acids as defined herein are also 
contemplated. 

Candidate agents are obtained from a wide variety of sources including libraries of 
synthetic or natural compounds. For example, numerous means are available for random and 

10 directed synthesis of a wide variety of organic compounds and biomolecules, including 

expression of randomized oligonucleotides, synthetic organic combinatorial libraries, phage 
display libraries of random peptides, and the like. Alternatively, libraries of natural 
compounds in the form of bacterial, fungal, plant, and animal extracts are available or readily 
produced. Additionally, natural and synthetically produced libraries and compounds can be 

15 readily be modified through conventional chemical, physical, and biochemical means. 

Further, known pharmacological agents may be subjected to directed or random chemical 
modifications such as acylation, alkylation, esterification, amidification, etc. to produce 
structural analogs of the agents. 

A variety of other reagents also can be included in the mixture. These include 

20 reagents such as salts, buffers, neutral proteins (e.g., albumin), detergents, etc. which may be 
used to facilitate optimal protein-protein and/or protein-nucleic acid binding. Such a reagent 
may also reduce non-specific or background interactions of the reaction components. Other 
reagents that improve the efficiency of the assay such as protease, inhibitors, nuclease 
inhibitors, antimicrobial agents, and the like may also be used. 

25 The mixture of the foregoing assay materials is incubated under conditions whereby, 

the anti-endometrial cancer candidate agent specifically binds the cellular binding target, a 
portion thereof or analog thereof. The order of addition of components, incubation 
temperature, time of incubation, and other paameters of the assay may be readily determined. 
Such experimentation merely involves optimization of the assay parameters, not the 

30 fundamental composition of the assay. Incubation temperatures typically are between 4°C 
and 40°C. Incubation times preferably are minimized to facilitate rapid, high throughput 
screening, and typically are between 0.1 and 10 hours. 
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After incubation, the presence or absence of specific binding between the anti- 
endometrial cancer candidate agent and one or more binding targets is detected by any 
convenient method available to the user. For cell-free binding type assays, a separation step 
is often used to separate bound from unbound components. The separation step may be 

5 accomplished in a variety of ways. Conveniently, at least one of the components is 
immobilized on a solid substrate, from which the unbound components may be easily 
separated. The solid substrate can be made of a wide variety of materials and in a wide 
variety of shapes, e.g., microtiter plate, microbead, dipstick, resin particle, etc. The substrate 
preferably is chosen to maximize signal-to-noise ratios, primarily to minimize background 

10 binding, as well as for ease of separation and cost. 

Separation may be effected for example, by removing a bead or dipstick from a 
reservoir, emptying or diluting a reservoir such as a microtiter plate well, rinsing a bead, 
particle, chromotograpic column or filter with a wash solution or solvent. The separation step 
preferably includes multiple rinses or washes. For example, when the solid substrate is a 

15 microtiter plate, the wells may be washed several times with a washing solution, which 
typically includes those components of the incubation mixture that do not participate in 
specific bindings such as salts, buffer, detergent, non-specific protein, etc. Where the solid 
substrate is a magnetic bead, the beads may be washed one or more times with a washing 
solution and isolated using a magnet. 

20 Detection may be effected in any convenient way for cell-based assays such as two- 

or three-hybrid screens. The transcript resulting from a reporter gene transcription assay of 
the anti-cancer agent binding to a target molecule typically encodes a directly or indirectly 
detectable product, e.g., P-galactosidase activity, luciferase activity, and the like. For cell- 
free binding assays, one of the components usually comprises, or is coupled to, a detectable 

25 label. A wide variety of labels can be used, such as those that provide direct detection (e.g., 
radioactivity, luminescence, optical, or electron density, etc) or indirect detection (e.g., 
epitope tag such as the FLAG epitope, enzyme tag such as horseseradish peroxidase, etc.). 
The label may be bound to an anti-cancer agent binding partner, or incorporated into the 
structure of the binding partner. 

30 A variety of methods may be used to detect the label, depending on the nature of the 

label and other assay components. For example, the label may be detected while bound to the 
solid substrate or subsequent to separation from the solid substrate. Labels may be directly 
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detected through optical or electron density, radioactive emissions, nonradiative energy 
transfers, etc. or indirectly detected with antibody conjugates, strepavidin-biotin conjugates, 
etc. Methods for detecting the labels are well known in the art. 

The invention provides endometrial cancer gene-specific binding agents, methods of 

5 identifying and making such agents, and their use in diagnosis, therapy and pharmaceutical 
development. For example, endometrial cancer gene-specific pharmacological agents are 
useful in a variety of diagnostic and therapeutic applications as described herein. In general, 
the specificity of an endometrial cancer gene binding to a binding agent is shown by binding 
equilibrium constants. Targets that are capable of selectively binding an endometrial cancer 

10 gene preferably have binding equilibrium constants of at least about 1 0 7 M" 1 , more preferably 
at least about 10 8 M" 1 , and most preferably at least about 10 9 M" 1 . The wide variety of cell- 
based and cell- free assays may be used to demonstrate endometrial cancer gene-specific 
binding. Cell-based assays include one, two and three hybrid screens, assays in which 
endometrial cancer gene-mediated transcription is inhibited or increased, etc. Cell-free 

15 assays include endometrial cancer gene-protein binding assays, immunoassays, etc. Other 
assays useful for screening agents which bind endometrial cancer polypeptides include 
fluorescence resonance energy transfer (FRET), and electrophoretic mobility shift analysis 
(EMSA). 

In another aspect of the invention, pre- and post-treatment alterations in expression of 
20 two or more sets of endometrial cancer nucleic acid markers including, but not limited to, 
SEQ ID NOs:l-50 in endometrial cancer cells or tissues may be used to assess treatment 
parameters including, but not limited to: dosage, method of administration, timing of 
administration, and combination with other treatments as described herein. 

Candidate pharmacological agents may include antisense oligonucleotides that 
25 selectively bind to an endometrial cancer nucleic acid marker molecule, as identified herein, 
to reduce the expression of the marker molecules in endometrial cancer cells and tissues. 
One of ordinary skill in the art can test of the effects of a reduction of expression of 
endometrial cancer nucleic acid marker sequences in vivo or in vitro, to determine the 
efficacy of one or more antisense oligonucleotides. 
30 As used herein, the term "antisense oligonucleotide" or "antisense" describes an 

oligonucleotide that is an oligoribonucleotide, oligodeoxyribonucleotide, modified 
oligoribonucleotide, or modified oligodeoxyribonucleotide, which hybridizes under 
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physiological conditions to DNA comprising a particular gene or to an mRNA transcript of 
that gene and, thereby, inhibits the transcription of that gene and/or the translation of that 
mRNA. The antisense molecules are designed so as to interfere with transcription or 
translation of a target gene upon hybridization with the target gene or transcript. Those 

5 skilled in the art will recognize that the exact length of the antisense oligonucleotide and its 
degree of complementarity with its target will depend upon the specific target selected, 
including the sequence of the target and the particular bases which comprise that sequence. It 
is preferred that the antisense oligonucleotide be constructed and arranged so as to bind 
selectively with the target under physiological conditions, i.e., to hybridize substantially more 

10 to the target sequence than to any other sequence in the target cell under physiological 
conditions. 

Based upon the sequences of endometrial cancer expressed nucleic acids, or upon 
allelic or homologous genomic and/or cDNA sequences, one of skill in the art can easily 
choose and synthesize any of a number of appropriate antisense molecules for use in 

15 accordance with the present invention. In order to be sufficiently selective and potent for 
inhibition, such antisense oligonucleotides should comprise at least 10 and, more preferably, 
at least 15 consecutive bases that are complementary to the target, although in certain cases 
modified oligonucleotides as short as 7 bases in length have been used successfully as 
antisense oligonucleotides (Wagner et al., 1996). Most preferably, the antisense 

20 oligonucleotides comprise a complementary sequence of 20-30 bases. Although 

oligonucleotides may be chosen that are antisense to any region of the gene or mRNA 
transcripts, in preferred embodiments the antisense oligonucleotides correspond to N-terminal 
or 5' upstream sites such as translation initiation, transcription initiation, or promoter sites. In 
addition, 3 '-untranslated regions may be targeted. Targeting to mRNA splicing sites has also 

25 been used in the art but may be less preferred if alternative mRNA splicing occurs. In 

addition, the antisense is targeted, preferably, to sites in which mRNA secondary structure is 
not expected (see, e.g., Sainio et al., 1994) and at which proteins are not expected to bind. 
Finally, although the listed sequences are cDNA sequences, one of ordinary skill in the art 
may easily derive the genomic DNA corresponding to the cDNA of an endometrial cancer 

30 expressed polypeptide. Thus, the present invention also provides for antisense 

oligonucleotides that are complementary to the genomic DNA corresponding to endometrial 
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cancer expressed nucleic acids. Similarly, the use of antisense to allelic or homologous 
cDNAs and genomic DNAs are enabled without undue experimentation. 

In one set of embodiments, the antisense oligonucleotides of the invention may be 
composed of "natural" deoxyribonucleotides, ribonucleotides, or any combination thereof. 
5 That is, the 5' end of one native nucleotide and the 3' end of another native nucleotide may be 
covalently linked, as in natural systems, via a phosphodiester internucleoside linkage. These 
oligonucleotides may be prepared by art-recognized methods, which may be carried out 
manually or by an automated synthesizer. They also may be produced recombinantly by 
vectors. 

10 In preferred embodiments, however, the antisense oligonucleotides of the invention 

also may include "modified" oligonucleotides. That is, the oligonucleotides may be modified 
in a number of ways that do not prevent them from hybridizing to their target but which 
enhance their stability or targeting or which otherwise enhance their therapeutic 
effectiveness. The term "modified oligonucleotide" as used herein describes an 

15 oligonucleotide in which (1) at least two of its nucleotides are covalently linked via a 

synthetic internucleoside linkage (i.e., a linkage other than a phosphodiester linkage between 
the 5' end of one nucleotide and the 3' end of another nucleotide) and/or (2) a chemical group 
not normally associated with nucleic acids has been covalently attached to the 
oligonucleotide. Preferred synthetic internucleoside linkages are phosphorothioates, 

20 alkylphosphonates, phosphorodithioates, phosphate esters, alkylphosphonothioates, 

phosphoramidates, carbamates, carbonates, phosphate triesters, acetamidates, carboxymethyl 
esters, and peptides. 

The term "modified oligonucleotide" also encompasses oligonucleotides with a 
covalently modified base and/or sugar. For example, modified oligonucleotides include 

25 oligonucleotides having backbone sugars that are covalently attached to low molecular 
weight organic groups other than a hydroxyl group at the 3' position and other than a 
phosphate group at the 5' position. Thus modified oligonucleotides may include a 2-0- 
alkylated ribose group. In addition, modified oligonucleotides may include sugars such as 
arabinose instead of ribose. The present invention, thus, contemplates pharmaceutical 

30 preparations containing modified antisense molecules that are complementary to and 

hybridizable with, under physiological conditions, endometrial cancer expressed nucleic 
acids, together with pharmaceutically acceptable carriers. 
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Antisense oligonucleotides may be administered as part of a pharmaceutical 
composition. Such a pharmaceutical composition may include the antisense oligonucleotides 
in combination with any standard physiologically and/or pharmaceutically acceptable carriers 
which are known in the art. The compositions should be sterile and contain a therapeutically 

5 effective amount of the antisense oligonucleotides in a unit of weight or volume suitable for 
administration to a patient. The term "pharmaceutically acceptable" means a non-toxic 
material that does not interfere with the effectiveness of the biological activity of the active 
ingredients. The term "physiologically acceptable" refers to a non-toxic material that is 
compatible with a biological system such as a cell, cell culture, tissue, or organism. The 

10 characteristics of the carrier will depend on the route of administration. Physiologically and 
pharmaceutically acceptable carriers include diluents, fillers, salts, buffers, stabilizers, 
solubilizers, and other materials, which are well known in the art. 

Expression of endometrial cancer nucleic acid molecules can also be determined 
using protein measurement methods to determine expression of SEQ ID NOs:l-50, e.g., by 

15 determining the expression of polypeptides encoded by SEQ ID NOs: 1 -50 (SEQ ID NOs: 5 1 - 
100). Preferred methods of specifically and quantitatively measuring proteins include, but 
are not limited to: mass spectroscopy-based methods such as surface enhanced laser 
desorption ionization (SELDI; e.g., Ciphergen ProteinChip System), non-mass spectroscopy- 
based methods, and immunohistochemistry-based methods such as 2-dimensional gel 

20 electrophoresis. 

SELDI methodology may, through procedures known to those of ordinary skill in the 
art, be used to vaporize microscopic amounts of tumor protein and to create a "fingerprint" 
of individual proteins, thereby allowing simultaneous measurement of the abundance of many 
proteins in a single sample. Preferably SELDI-based assays may be utilized to classify 

25 endometrial cancer tumors. Such assays preferably include, but are not limited to the 

following examples. Gene products discovered by RNA microarrays may be selectively 
measured by specific (antibody mediated) capture to the SELDI protein disc (e.g., selective 
SELDI). Gene products discovered by protein screening (e.g., with 2-D gels), may be 
resolved by "total protein SELDI" optimized to visualize those particular markers of interest 

30 from among SEQ ID NOs: 1-50. Predictive models of tumor classification from SELDI 
measurement of multiple markers from among SEQ ID NOs: 1-50 may be utilized for the 
SELDI strategies. In an additional embodiment a set of endometrioid endometrial 
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adenocarcinoma tissues may be preferably utilized to determine the risk classification of 
endometrial cancer based on SELDI results. 

The invention also involves agents such as polypeptides that bind to endometrial 
cancer- associated polypeptides, i.e., SEQ ID NOs:5 1-100. Such binding agents can be used, 
for example, in screening assays to detect the presence or absence of endometrial cancer- 
associated polypeptides and complexes of endometrial cancer-associated polypeptides and 
their binding partners and in purification protocols to isolate endometrial cancer-associated 
polypeptides and complexes of endometrial cancer-associated polypeptides and their binding 
partners. Such agents also may be used to inhibit the native activity of the endometrial 
cancer-associated polypeptides, for example, by binding to such polypeptides. 

The invention, therefore, embraces peptide binding agents which, for example, can be 
antibodies or fragments of antibodies having the ability to selectively bind to endometrial 
cancer-associated polypeptides. Antibodies include polyclonal and monoclonal antibodies, 
prepared according to conventional methodology. 

Significantly, as is well-known in the art, only a small portion of an antibody 
molecule, the paratope, is involved in the binding of the antibody to its epitope (see, in 
general, Clark, W.R. (1986) The Experimental Foundations of Modern Immunology Wiley & 
Sons, Inc., New York; Roitt, I. (1991) Essential Immunology , 7th Ed., Blackwell Scientific 
Publications, Oxford). The pFc' and Fc regions, for example, are effectors of the complement 
cascade but are not involved in antigen binding. An antibody from which the pFc' region has 
been enzymatically cleaved, or which has been produced without the pFc' region, designated 
an F(ab') 2 fragment, retains both of the antigen binding sites of an intact antibody. Similarly, 
an antibody from which the Fc region has been enzymatically cleaved, or which has been 
produced without the Fc region, designated an Fab fragment, retains one of the antigen 
binding sites of an intact antibody molecule. Proceeding further, Fab fragments consist of a 
covalently bound antibody light chain and a portion of the antibody heavy chain denoted Fd. 
The Fd fragments are the major determinant of antibody specificity (a single Fd fragment 
may be associated with up to ten different light chains without altering antibody specificity) 
and Fd fragments retain epitope-binding ability in isolation. 

Within the antigen-binding portion of an antibody, as is well-known in the art, there 
are complementarity determining regions (CDRs), which directly interact with the epitope of 
the antigen, and framework regions (FRs), which maintain the tertiary structure of the 
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paratope (see, in general, Clark, 1986; Roitt, 1991). In both the heavy chain Fd fragment and 
the light chain of IgG immunoglobulins, there are four framework regions (FR1 through FR4) 
separated respectively by three complementarity determining regions (CDR1 through CDR3). 
The CDRs, and in particular the CDR3 regions, and more particularly the heavy chain CDR3, 

5 are largely responsible for antibody specificity. 

It is now well-established in the art that the non-CDR regions of a mammalian 
antibody may be replaced with similar regions of conspecific or heterospecific antibodies 
while retaining the epitopic specificity of the original antibody. This is most clearly 
manifested in the development and use of "humanized" antibodies in which non-human 

10 CDRs are covalently joined to human FR and/or Fc/pFc' regions to produce a functional 
antibody. See, e.g., U.S. patents 4,816,567, 5,225,539, 5,585,089, 5,693,762 and 5,859,205. 

Fully human monoclonal antibodies also can be prepared by immunizing mice 
transgenic for large portions of human immunoglobulin heavy and light chain loci. 
Following immunization of these mice (e.g., XenoMouse (Abgenix), HuMAb mice 

15 (Medarex/GenPharm)), monoclonal antibodies can be prepared according to standard 

hybridoma technology. These monoclonal antibodies will have human immunoglobulin 
amino acid sequences and therefore will not provoke human anti-mouse antibody (HAMA) 
responses when administered to humans. 

Thus, as will be apparent to one of ordinary skill in the art, the present invention also 

20 provides for F(ab'>2, Fab, Fv and Fd fragments; chimeric antibodies in which the Fc and/or 
FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric F(ab'>2 fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 
homologous human or non-human sequences; chimeric Fab fragment antibodies in which the 

25 FR and/or CDR1 and/or CDR2 and/or light chain CDR3 regions have been replaced by 

homologous human or non-human sequences; and chimeric Fd fragment antibodies in which 
the FR and/or CDR1 and/or CDR2 regions have been replaced by homologous human or non- 
human sequences. The present invention also includes so-called single chain antibodies. 
Thus, the invention involves polypeptides of numerous size and type that bind 

30 specifically to polypeptides selected from SEQ ID NOs:51-100, and complexes of both 

endometrial cancer-associated polypeptides and their binding partners. These polypeptides 
may be derived also from sources other than antibody technology. For example, such 
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polypeptide binding agents can be provided by degenerate peptide libraries which can be 
readily prepared in solution, in immobilized form or as phage display libraries. 
Combinatorial libraries also can be synthesized of peptides containing one or more amino 
acids. Libraries further can be synthesized of peptoids and non-peptide synthetic moieties. 

5 Phage display can be particularly effective in identifying binding peptides useful 

according to the invention. Briefly, one prepares a phage library (using e.g. ml 3, fd, or 
lambda phage), displaying inserts from 4 to about 80 amino acid residues using conventional 
procedures. The inserts may represent, for example, a completely degenerate or biased array. 
One then can select phage-bearing inserts which bind to the endometrial cancer-associated 

10 polypeptide. This process can be repeated through several cycles of reselection of phage that 
bind to the endometrial cancer-associated polypeptide. Repeated rounds lead to enrichment 
of phage bearing particular sequences. DNA sequence analysis can be conducted to identify 
the sequences of the expressed polypeptides. The minimal linear portion of the sequence that 
binds to the endometrial cancer-associated polypeptide can be determined. One can repeat 

15 the procedure using a biased library containing inserts containing part or all of the minimal 
linear portion plus one or more additional degenerate residues upstream or downstream 
thereof. Yeast two-hybrid screening methods also may be used to identify polypeptides that 
bind to the endometrial cancer-associated polypeptides. 

Thus, the endometrial cancer-associated polypeptides of the invention, including 

20 fragments thereof, can be used to screen peptide libraries, including phage display libraries, 
to identify and select peptide binding partners of the endometrial cancer-associated 
polypeptides of the invention. Such molecules can be used, as described, for screening 
assays, for purification protocols, for interfering directly with the functioning of endometrial 
cancer-associated polypeptides and for other purposes that will be apparent to those of 

25 ordinary skill in the art. For example, isolated endometrial cancer-associated polypeptides 
can be attached to a substrate (e.g., chromatographic media, such as polystyrene beads, a 
filter, or an array substrate), and then a solution suspected of containing the binding partner 
may be applied to the substrate. If a binding partner that can interact with endometrial 
cancer-associated polypeptides is present in the solution, then it will bind to the substrate- 

30 endometrial cancer-associated polypeptide. The binding partner then may be isolated. 

As detailed herein, the foregoing antibodies and other binding molecules may be used 
for example, to identify tissues expressing protein or to purify protein. Antibodies also may 
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be coupled to specific diagnostic labeling agents for imaging of cells and tissues that express 
endometrial cancer-associated polypeptides or to therapeutically useful agents according to 
standard coupling procedures. Diagnostic agents include, but are not limited to, barium 
sulfate, iocetamic acid, iopanoic acid, ipodate calcium, diatrizoate sodium, diatrizoate 
meglumine, metrizamide, tyropanoate sodium and radiodiagnostics including positron 
emitters such as fluorine- 18 and carbon-11, gamma emitters such as iodine- 123, 
technitium-99m, iodine- 1 3 1 and indium- 111, nuclides for nuclear magnetic resonance such as 
fluorine and gadolinium. 

The invention further includes protein microarrays for analyzing expression of 
endometrial cancer-associated peptides selected from SEQ ID NOs:5 1-100. In this aspect of 
the invention, standard techniques of microarray technology are utilized to assess expression 
of the endometrial cancer-associated polypeptides and/or identify biological constituents that 
bind such polypeptides. The constituents of biological samples include antibodies, 
lymphocytes (particularly T lymphocytes), and the like. Protein microarray technology, 
which is also known by other names including: protein chip technology and solid-phase 
protein array technology, is well known to those^of ordinary skill in the art and is based on, 
but not limited to, obtaining an array of identified peptides or proteins on a fixed substrate, 
binding target molecules or biological constituents to the peptides, and evaluating such 
binding. See, e.g., G. MacBeath and S.L. Schreiber, "Printing Proteins as Microarrays for 
High-Throughput Function Determination," Science 289(5485): 1760-1 763, 2000. 

Preferably antibodies or antigen binding fragments thereof that specifically bind 
polypeptides selected from the group consisting of SEQ ID NOs:5 1-100 are attached to the 
microarray substrate in accordance with standard attachment methods known in the art. 
These arrays can be used to quantify the expression of the polypeptides identified herein. 

In some embodiments of the invention, one or more control peptide or protein 
molecules are attached to the substrate. Preferably, control peptide or protein molecules 
allow determination of factors such as peptide or protein quality and binding characteristics, 
reagent quality and effectiveness, hybridization success, and analysis thresholds and success. 

The use of such methods to determine expression of endometrial cancer nucleic acids 
from among SEQ ID NOs:l-50 and/or proteins from among SEQ ID Nos:5 1-100 can be done 
with routine methods known to those of ordinary skill in the art and the expression 
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determined by protein measurement methods may be used as a prognostic method for 
selecting treatment strategies for endometrial cancer patients. 

Examples 

5 To establish a prognostic tool for designing endometrial cancer treatment regimens, 

expression patterns in primary endometrialcancer specimens were assessed and correlated 
with clinical outcome. 

Tissue processing: 

10 RNA isolated from normal cycling (proliferative, n=2; secretory, n=2) and neoplastic 

(endometrioid adenocarcinoma, n=10) human endometrial specimens was reverse transcribed 
and resultant cDNA used for in vitro transcriptional synthesis of fluorescently labeled nucleic 
acid probes according to manufacturer's instructions. Each resultant tissue-derived probe was 
then separately hybridized to an Affymetrix HuFL human expression array and hybridization 
- 15 images analyzed with Affymetrix software to generate a data matrix of named probes by 
quantitative expression level in each tissue. 

Data Normalization: 

Average differences for each sample were rescaled to sum to 3,000,000 over all 
20 genes. Then the average differences with an Affymetrix call of Absent or Marginal were set 
to 20, and average differences with a call of Present but with less than 20 were also set to 20. 
This resulted in a dataset truncated on the left tail at a value of 20, in which only genes 
determined to be "present" by the Affymetrix call were included as positive expression 
values. 

25 

Permax Test: 

Standard pooled variance t-statistics comparing the 4 normal samples to the 10 tumor 
samples were computed separately for each gene from their log values. Log values were used 
because it is natural to think of differences between tissue types as a multiplicative effect or 
30 ratio increase/decrease. Only genes with at least 2 values > 20 were considered (3665 genes), 
since the t statistic is undefined for genes with all values = 20, and the statistic is either 1 .69 
or -.62 with only one value not equal to 20, regardless of the value. 
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The permutation distribution was used to assess the significance of t-statistics 
calculated for each gene in the dataset (Permax test). The customized program written in S- 
plus language to calculate Permax is a data analysis software tool for testing the significance 
of gene expression. It has been presented by Mutter, et al., 8th International Workshop on 

5 Chromosomes in Solid Tumors, Tucson, AZ, 2000; and is available online 2 at 

biowww.dfci.harvard.edu/~gray/permax.html and from Robert J. Gray, Department of 
Biostatistical Science, Dana-Farber Cancer Institute, 44 Binney Street Boston, MA 021 15. 
Permax details enclosed therein are incorporated by reference herein.. In this approach all 
1001 possible ways of dividing the 14 samples into two groups of sizes 4 and 10 were 

10 considered. For each of these, the t-statistics were computed for each gene. With unequal 

group sizes, these distributions are not symmetric, so the significance was assessed separately 
in each direction. To control the overall error rate, the distributions of the maximum and 
minimum t-statistics over the genes were used. That is, for each gene, the p-value in the 
direction with expression higher (lower) in normals is the proportion of permutations with the 

15 minimum (maximum) t statistic over all genes less than (greater than) or equal to the 

observed value for the particular gene. A test declaring as significant any genes with say 
p<.50 then guarantees that the chance of any false positives being selected is <50%. 

The t statistics have a tendency to preferentially select genes with very small 
variances within a group. Because of this it may be appropriate to also require minimum 

20 criteria for differences between the group means. After determining the most significant 
genes from the t statistics, those genes with absolute differences between means <100, and 
ratios of means <3 were identified. 

Table 1 is a spreadsheet identifying 50 genes which discriminate normal cycling from 
25 malignant endometrium. 
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The present invention is not limited in scope by the examples provided, since the 
examples are intended as illustrations of various aspects of the invention and other 
functionally equivalent embodiments are within the scope of the invention. Various 
5 modifications of the invention in addition to those shown are described herein will become 
apparent to those skilled in the art for the foregoing description and fall within the scope of 
the appended claims. The advantages and objects of the invention are not necessarily 
encompassed by each embodiment of the invention. 

All references, patents, and patent publications that are recited in this application are 
10 incorporated in their entirety herein by reference. 
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